AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
General Visual Representation

# General Visual Representation

Sam2 Hiera Small.fb R896 2pt1
Apache-2.0
SAM2 weights (HieraDet image encoder only) based on the timm library, derived from Facebook's Hiera small model.
Image Segmentation Transformers
S
timm
67
0
CLIP ViT B 32 CommonPool.M.text S128m B4k
MIT
A vision-language model based on the CLIP architecture, supporting zero-shot image classification tasks
Text-to-Image
C
laion
68
0
CLIP ViT B 32 CommonPool.M S128m B4k
MIT
Zero-shot image classification model based on CLIP architecture, supporting general vision-language tasks
Text-to-Image
C
laion
79
0
CLIP ViT B 32 CommonPool.S.basic S13m B4k
MIT
A vision-language model based on the CLIP architecture, supporting zero-shot image classification tasks
Image-to-Text
C
laion
53
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase